CDS
Accession Number | TCMCG075C24747 |
gbkey | CDS |
Protein Id | XP_017981049.1 |
Location | join(16825695..16826081,16826629..16827060,16828997..16829555,16831109..16831404) |
Gene | LOC18592971 |
GeneID | 18592971 |
Organism | Theobroma cacao |
Protein
Length | 557aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018125560.1 |
Definition | PREDICTED: probable alkaline/neutral invertase D isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGATGGGACTAAAGAGATGGGACTTAGAAATGTGAGCTCAACCTGCTCAATTTCCGAAATGGATGATTATGATCTGTCACGCCTTCTTAACAAGCCAAAGCTTAACATAGAGAGGCAAAGATCATTTGATGAGAGGTCACTAAGTGAGCTCTCTATTGGTCTCACTAGAGGAAGCTATGACAATTATGAGACCACCCACTCGCCTGGTGGGAGGTCAGGTTTTGATACTCCGGCTTCATCAGCTAGAAATTCCTTTGAACCTCACCCCATGGTGGCTGAAGCATGGGAAGCTCTCAGGAGATCATTGGTGTATTTCAGAGGCCAACCCGTTGGTACCATTGCCGCATATGATCATGCTTCTGAGGAAGTTTTGAACTATGATCAGGTTTTTGTTCGAGATTTTGTACCCAGTGCTCTGGCTTTTCTGATGAATGGAGAGCCTGAGATAGTTAAGAACTTCCTCTTGAAGACCCTACAACTTCAAGGGTGGGAGAAAAGAATAGATAGATTCAAGCTAGGGGAAGGTGCAATGCCAGCTAGCTTCAAAGTGCTTCATGATCCTGTACGTAAAACAGACACAATTATTGCAGATTTTGGAGAGAGTGCCATTGGACGAGTTGCTCCAGTTGACTCTGGATTTTGGTGGATAATTCTGCTCCGTGCATATACAAAATCTACCGGGGATTTATCTCTTGCGGAGACACCTGAGTGTCAAAAAGGAATGAGGCTCATACTTACTCTGTGTCTATCAGAAGGATTTGATACATTCCCAACCCTACTTTGTGCTGATGGATGCTCTATGATTGATCGAAGAATGGGTATTTATGGTTATCCTATTGAAATTCAAGCACTTTTCTTTATGGCGTTGAGGTGTGCTTTATCAATGCTGAAGCATGATGCAGAAGGAAAAGAGTGCATTGAAAGAATTGTAAAGCGTTTGCATGCCTTGAGTTATCACATGCGCAGTTACTTTTGGCTTGACTTTCAACAACTAAATGATATTTACAGATATAAAACTGAGGAATATTCTCACACAGCAGTAAATAAGTTTAATGTTATTCCTGATTCAATTCCTGACTGGGTATTTGATTTTATGCCAACACGAGGTGGCTACTTTATTGGCAATGTTAGTCCTGCAAGGATGGATTTCCGATGGTTTTGTTTAGGTAACTGTATAGCAATCCTATCTTCTCTTGCAACTCCAGAGCAATCAATGGCTATAATGGACCTTATTGAAGCCCGTTGGGATGAGCTTGTTGGAGAAATGCCTTTAAAAATAGCTTATCCTGCAATAGAAAGTCATGACTGGCGAATTGTCACTGGTTGTGACCCTAAGAACACGAGATGGAGTTATCACAATGGAGGATCCTGGCCAGTGCTTTTGTGGTTGCTAACTGCTGCTTGCATCAAGACGGGAAGACCACAAATTGCAAGACGAGCTATTGATCTTGCTGAGACACGTTTGCTGAAAGATAGCTGGCCAGAATATTATGATGGCACACTTGGGAGATTTATTGGTAAACAGGCTCGGAAGTATCAGACATGGTCAATAGCAGGATATTTAGTGGCAAAAATGATGCTAGAGGATCCGTCTCACTTGGGGATGATTTCTCTGGAAGAGGACAAGCAGATGAAGCCATTGATAAAGAGATCATCTTCTTGGAATTGCTAA |
Protein: MDGTKEMGLRNVSSTCSISEMDDYDLSRLLNKPKLNIERQRSFDERSLSELSIGLTRGSYDNYETTHSPGGRSGFDTPASSARNSFEPHPMVAEAWEALRRSLVYFRGQPVGTIAAYDHASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVLHDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALSMLKHDAEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPDWVFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPEQSMAIMDLIEARWDELVGEMPLKIAYPAIESHDWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAETRLLKDSWPEYYDGTLGRFIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQMKPLIKRSSSWNC |